A toolkit for optimising parallel performance
نویسندگان
چکیده
Three interacting tools to assist distributed memory programmers in developing, optimising and understanding application performance have been developed. These tools perform automatic code generation from an initial workload speciication, performance prediction using memory hierarchy simulation, and performance visualisation for distributed memory message passing applications. Their combination facilitates extensive performance tuning from initial workload speciication through to completed message passing program codes.
منابع مشابه
Performance Measurement and Modeling with the Lost Cycles Toolkit
Although there are many situations in which a model of application performance is valuable, performance modeling of parallel programs is not commonplace, largely because of the diiculty of developing accurate models of real applications executing on real multiprocessors. This paper describes a toolkit for performance tuning and prediction based on lost cycles analysis. Lost cycles analysis deco...
متن کاملA Linear Algebra Formulation for Optimising Replication in Data Parallel Programs
In this paper, we present an efficient technique for optimising data replication under the data parallel programming model. We propose a precise mathematical representation for data replication which allows handling replication as an explicit, separate stage in the parallel data placement problem. This representation takes the form of an invertible mapping. We argue that this property is key to...
متن کاملBenchmarking and Optimising Software for Improved Multiprocessor Performance
Parallel computers have become widely available with the introduction of multicore processors, however software will need to be optimised to make use of the additional processors. A number of tools can aid the process of optimising code for parallel execution, such as parallel debuggers, profilers and parallel libraries. This paper describes an approach to evaluating the effectiveness of a sele...
متن کاملA toolkit for parallel image processing
In this paper, we present the design and implementation of a parallel image processing software library (the Parallel Image Processing Toolkit). The Toolkit not only supplies a rich set of image processing routines, it is designed principally as an extensible framework containing generalized parallel computational kernels to support image processing. Users can easily add their own image process...
متن کاملThe semi-automatic parallelisation of scientific application codes using a computer aided parallelisation toolkit
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. Historically, the lack of a programming standard for using directives and the rather limited performance due to scalability have affected the take-up of this programming model approach. Significant progress has been made in hardware and software technologies, as a result the p...
متن کامل